TREES, RNA SECONDARY STRUCTURES AND CLUSTER 

COMBINATORICS 



ROBERT J. MARSH AND SIBYLLE SCHROLL 

Abstract. We develop and study the structure of combinatorial objects that 
are a special case of RNA secondary structures. These are generalisations of 
objects arising from interval exchange transformations generalising those in 
the Sturmian context. We represent them as labelled edge-coloured trees. We 
classify and enumerate them and show that a natural subset is in bijection 
with a set of m-clusters. Furthermore, we interpret a notion of induction used 
to model generalised interval exchange transformations as a composition of 
cluster mutations. 



1. Introduction 

One of the earliest occurrences of Sturmian words is in a number-theoretic con- 
text, in a paper from 1882 by Markov [23] in which he answers a question of Jean 
Bernoulli. Half a century later, in the early stages of symbolic dynamics, Morse and 
Hedlund introduced the term Sturmian. Sturmian words form a typical example of 
a minimal set of a dynamical system; this led to a systematic study of their many 
combinatorial properties. A natural question is how to generalise these special lan- 
guages to wider classes with less strict combinatorial properties. Sturmian words 
correspond to interval exchange transformations of two intervals, and one gener- 
alisation of Sturmian languages arises from interval exchange transformations of 
several intervals; see [5]. The combinatorial properties of the associated languages, 
which were studied in [2], motivated this article. 

Another natural generalisation is to consider higher dimensional analogues of 
interval exchange transformations. The languages arising from this will have a 
more complex combinatorial structure and it is not immediately obvious what they 
should look like. In this paper we have taken a complementary approach, gen- 
eralising the underlying combinatorics and studying it in detail. This generates 
interesting combinatorics in its own right. It also opens up a possible route to in- 
troducing and developing combinatorial languages for higher dimensional exchange 
transformations. In a future paper [23] . currently in preparation, we are taking this 
approach. 

On the other hand, we show that the combinatorics we develop here has a con- 
nection with cluster algebras. Cluster algebras were introduced by Fomin and 
Zelevinsky in [12] in order to model the properties of the dual canonical basis (or 
global crystal basis) of a quantized enveloping algebra and to study total posi- 
tivity. As well as bringing greater understanding to these fields, cluster algebras 
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have been applied with much success in the fields of combinatorics, representation 
theory and related category theory, as well as other fields. See, for example, the 
surveys [TT 1 I22 ] . 

In particular, cluster algebras led to the introduction [13j [14] of a natural family 
of simplicial complexes, known as generalized associahedra or cluster complexes, 
classified by the Dynkin diagrams. The type A case corresponds to the Stasheff 
polytope. Corresponding polytopes for every Dynkin diagram were constructed 
in [3] . This family of complexes was generalised further in [TU] to give the generalised 
cluster complexes. By giving natural bijections, we show how the combinatorial 
objects discussed here relate closely to the generalised cluster complexes of type A. 

We now give a more detailed description of the contents of the paper. Given 
natural numbers k, m we consider the set of labelled m-edge-coloured trees with k 
vertices, i.e. trees with k vertices, labelled with the numbers 1,2, ... ,k, with each 
edge coloured with one of m possible symbols Si, S?, . . . , S m in such a way that the 
edges incident with a given vertex all have different colours (see [33] 5.28, p81]); 
see Figure SJc) for an example. Such trees have a natural cyclic ordering on their 
vertices induced by the edge-labelling which we refer to as the circular order. 

We show (Section 7) that a form of generalised induction on these trees exists 
(taking such a tree to another such tree) and that the equivalence classes of its 
transitive closure are characterised by the circular order. Thus the set of equivalence 
classes is in bijcction with the set of fc-cycles in the symmetric group of degree k. 

The case m = 3 was studied in [2J, where the trees we study here were referred 
to as trees of relations. Furthermore, it was shown in [5] that the induction in the 
case m = 3 induces languages naturally generalising Sturmian languages, arising 
from interval exchange transformations associated to permutations. 

We also consider RNA secondary structures (in the sense of Waterman [55] ; 
see also [35]) on the sequence (S1S2 ■ ■ ■ S m ) k with the rule that pairings can only 
be made when the bases at the end-points are equal. Such a secondary structure 
consists of k vertices, 1,2, ... ,k, arranged clockwise around a circle, with m bases 
given by the m symbols Si, S2, ■ ■ ■ , S m arranged clockwise at each vertex. There is 
a collection of arcs linking equal bases (with no symbol incident to more than one 
arc). We call such an RNA secondary structure an RNA m- diagram of degree k. 

An RNA m-diagram is said to be noncrossing if it can be drawn in such a way 
that no two arcs cross. We say that it is connected if every vertex is connected to 
every other vertex via a sequence of arcs, regarding all of the bases at any given 
vertex as being connected together. For an example, see Figure Ufa). Diagrams of 
this form, for the case m = 3, were considered in [2J. In Section 2 we give a detailed 
description of the combinatorial objects we are considering in this article. 

We first show that the set of labelled m-edge-coloured trees with k vertices and 
circular order (12 • ■ • k) (an induction equivalence class) is in bijection with the 
set of connected noncrossing RNA m-diagrams of degree k. We then compute the 
cardinality of each of these sets in three different ways. The first method (Section 

3) involves a bijection with the set of rooted diagonal-coloured m-angulations of a 
regular (m — 2)k + 2-sided polygon, up to rotation. The second method (Section 

4) involves a recursion formula along the lines of |351 Thm. 2.1]. The third method 
(Section 5) involves showing that the sequence whose fcth term is the number of 
connected noncrossing RNA m-diagrams with k vertices is the m-fold convolution 
of the Fuss- Catalan sequence of degree m — 1. This can be regarded as giving an 
alternative proof of a formula for this term (which is given in [20, §3]). 

Since the number of induction equivalence classes of labelled m-edge coloured 
trees with k vertices is equal to the number of fc-cycles in the symmetric group of 
degree k, and each equivalence class has the same cardinality, we obtain a new way 
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(Section 6) to count the labelled m-edge-coloured trees on k vertices (this number 
is known: see [3J 5.28, pl24]). 

As mentioned above, in Section 7 we study the generalised induction on labelled 
m-edge coloured trees with k vertices. Finally, in Section 8, we give a description 
of the generalised induction defined here in terms of (m — 2)-cluster mutation. 

2. Some combinatorial objects 

We firstly introduce the main combinatorial objects we will be considering in 
this article: edge-coloured trees, RNA m-diagrams, and m-angulations. 

2.1. Trees. As mentioned above, given positive integers k, m, we will consider 
labelled m-edge-coloured trees on k vertices, i.e. trees whose vertices are labelled 
1,2, . . . , k and whose edges are coloured with one of the m symbols Si, S2, . . . , S rn 
in such a way that no two edges incident with the same vertex have the same colour. 
We will also consider m-edge coloured trees on k vertices (i.e. without any labelling 
on the vertices). Note that either kind of tree may be rooted, i.e. come together 
with a distinguished vertex. As is usual, if the connectedness assumption is not 
satisfied, we refer to the corresponding objects as forests. See Figure HJb) for an 
example of a (rooted) m-edge-coloured tree and (c) for an example of a labelled 
m-edge-coloured tree. 

Given an m-edge coloured tree S with k vertices, each symbol S r determines a 
map (with the same name) from the set of vertices of iS to itself. A vertex is fixed 
by S r unless it is incident with an edge coloured S r , in which case it is sent to the 
vertex at the other end of that edge. Let as be the composition S m S m -i ■ ■ ■ Si: 
this is a permutation of the vertices of S. We use the same definition for a labelled 
m-edge coloured tree, G, obtaining a permutation <jq in the symmetric group of 
degree k. 

We refer to as (respectively, ctg) as the circular order of S (respectively, G). 
These maps (for 3 symbols) were considered in [2] . We shall see later (see Lemma ltTTj) 
that the circular order is always a fc-cycle. 

2.2. RNA secondary structure. The primary structure of an RNA molecule is 
a linear sequence of nucleotides, each containing a base of one of four types, labelled 
A (adenine), C (cytosine), G (guanine), or U (uracil). The bases may form pairs 
and such pairing is known as the secondary structure of the RNA (see e.g. |31|V 

This secondary structure is represented by a diagram defined by Waterman in [3 5) 
in a key paper initiating the combinatorial study of RNA secondary structure. See 
also [28l Chapter 2]. The diagram is a graph with vertices 1,2, ... , fc (corresponding 
to the bases) such that the valency of each vertex is at most 1, represented in the 
plane by drawing the vertices 1, 2, . . . , k in a horizontal line. A pairing between the 
bases at vertices i and j, with i < j, is represented by an arc (i, j); its length is 
said to be j — i. The number k is called the length of the diagram. For an example, 
see Figure [TJ 

A crossing is a pair (u,ji), ^2^2) of distinct arcs satisfying i\ < i2 < ji < J2, 
and a diagram is said to be noncrossing (or pseudoknot-free) if it contains no 
crossings. 

Diagrams are often drawn on a circle, using the Nussinov circle representa- 
tion [25]; see Figured! 

The combinatorics of the secondary structure of RNA was initially developed by 
Waterman and his collaborators; see [35J ISS] an d e.g. [37J for a survey of some later 
work. Others have worked on the subject, e.g. [18| . More recently the combinatorics 
has also been studied extensively by Reidys; see [2S] and references therein. 
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Figure 1. A diagram (RNA secondary structure) of length 11 and 
its Nussinov circle representation. 

We remark that RNA secondary structure is also interesting from a topologi- 
cal perspective; see e.g. [53]. Also, the article [27J counts modular diagrams (i.e. 
having no isolated arcs and with each arc of length at least 4) satisfying a certain 
noncrossing property, important for a particular model of RNA. RNA secondary 
structures can be represented using permutations; see |38j . 

Note that RNA secondary structures can also be considered in a more abstract 
setting, i.e. with an arbitrary finite set of possible bases and arbitrary pairing rules; 
see [35] §3] (or [TS] §5]). Here we consider such a setup in which there are m bases, 
Si, 5*2, . • • , S m . We take the sequence (S1S2 ■ ■ ■ S m ) , and the rule that pairings can 
only be made provided the bases are equal. We define an RNA m-diagram of degree 
A; to be a diagram in the above sense, drawn in the Nussinov circle representation, 
but keeping the bases as labels on the vertices. 

Thus there are k vertices 1,2, ... ,k, numbered clockwise in a circle. Each vertex 
contains the bases Si, S2, ■ ■ ■ , S m written in order clockwise, and a collection of arcs 
connect equal bases at different vertices. A base at a vertex is incident to at most 
one arc. Such a diagram is noncrossing if it can be drawn in such a way that there 
are no crossings between the arcs. We say that it is connected provided there is a 
path between any two vertices (where moving between bases at a vertex is allowed). 
See Figure[Ua) for an example of a connected noncrossing RNA 4-diagram of degree 
10. 

We note that the combinatorics of repeated RNA sequences is also considered 
in [T9] §3] but that a symmetry condition is imposed. We also remark that a 
bijection between a set of RNA secondary structures and a set of trees of a certain 
kind is given in [25], but this is not the same as the bijections considered here. 

We also remark that the article pQ defines Fuss-Catalan algebras, whose basis 
diagrams are the same as RNA m-diagrams except that every base is incident with 
exactly one arc, there is no connectedness condition and the ordering of the bases 
is reversed in neighbouring vertices except possibly the first and last vertex. In 
this case, as the name suggests, the number of basis diagrams coincides with a 
Fuss-Catalan number. 

If the bases at each vertex are all amalgamated into a single vertex, a noncrossing 
RNA m-diagram of degree k gives rise to a generalised version of a tangled diagram 
in the sense of 4, 5 , i.e. in which every vertex has valency at most m (instead of the 
usual restriction of 2). But note that not every tangled diagram arises in this way; 
in particular, the diagrams obtained in this way will have at most one arc between 
any given pair of vertices, no crossings, and no loops or cycles (see Theorem 13. ip . 
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The case we consider has its own combinatorics, which we will investigate here. 

2.3. m-angulations. Now let P be an dk + 2-sided regular polygon. A d-diagonal 
in P is a diagonal joining two vertices of P which divides P into an dj + 2-sided 
polygon and an d(n — j) + 2-sided polygon for some j where 1 < j < l" 2 *^]- A 
maximal collection of d-diagonals of P divides P into d + 2-sided polygons. Such 
dissections of P are referred to as d- divisible dissections in 34 . Taking d = m — 2, 
such dissections divide P into m-sided polygons, and we shall refer to them here as 
m-angulations of P. 

We say that an m-angulation of P is diagonal- coloured if each diagonal in P is 
coloured with a symbol from the set {Si, S%, . . . , S m } in such a way that the colours 
on the sides of each m-sided polygon in the m-angulation are Si, S2, ■ ■ ■ , S m in 
clockwise order. We say that it is rooted if there is a distinguished m-sided polygon 
in the m-angulation. We say that it is m-gon-labelled if the m-gons are labelled 
1,2, ... ,k. See Figure [5] for an example of a diagonal-coloured 4-angulation. 



3. BlJECTIONS 

Our main aim in this section is to compute the number of degree k connected 
noncrossing RNA m-diagrams. We shall also give a bijective proof that this is the 
same as the number of labelled m-edge-coloured trees with k vertices and circular 
order (k k — 1 • ■ ■ 1). This is a generalisation of the case m — 3 appearing in [2] 
Sect. 8]. 

Theorem 3.1. There is a bijection between the following sets: 
(I) The set of degree k noncrossing RNA m-diagrams. 

(II) The set of labelled m-edge-coloured forests G with k vertices such that, on 
writing G = LI^Gi as a union of connected components, we have the follow- 
ing: 

(a) If i 7^ j and 0,1,0,2 £ Gi, b\,b 2 £ Gj, we cannot have that a% > b\ > 
a 2 > b 2 . 

(b) If a £ Gi for some i, then o~g{o) is the maximal vertex of G less than 
a lying in Gi ( or, if no such vertex exists, it is the largest vertex of G 
lying in Gi). 

Proof. Let S be a noncrossing RNA m-diagram of degree k on m symbols as in (I) . 

Let G be the labelled m-edge-coloured graph with k vertices whose edges given 
by the arcs of E. That is, there is an edge between vertices i and j of G coloured 
with Sk if and only if there is an arc in E between the instances of the symbol Sk 
in vertices i and j in E. 

Claim: G is a forest. 

We prove the claim. Suppose, for a contradiction, there is a cycle 

Si 1 Si 2 *j>— 1 

ai <i2 • • • ip-i dp a\ 

in G, and thus a corresponding cycle in E. Without loss of generality, we may 
assume that i\ < «2- For vertices a, b, we denote by (a, b) the set of vertices c of E 
lying strictly clockwise of a and strictly anticlockwise of b. 

Then 03 £ (02,01), since the arc in E corresponding to the edge in G between 
a 2 and 03 cannot cross the arc in E corresponding to the edge in G between a\ and 
a 2 . 

By assumption on E, 12 7^ £3. If «2 > iz, there can be no path in E from 
the symbol Si 3 in vertex 03 of E back to vertex ai of E without crossings, a 
contradiction, hence 12 < 43. 
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Figure 2. A cycle in G leads to a crossing 

Repeating this argument, we see that, moving clockwise on E from a\ we meet 
vertices (Z2, (23, . . . , a v in order before returning to a\, and that i\ < 12 < ■ ■ ■ < i p . 
But then the arc between a p and 01 (on symbol Si crosses the arc between a\ and 
02 (on symbol Si 1 ) 1 since i\ < i p ; see Figure [2] Hence G has no cycles, and must 
be a tree. The claim is shown. 

We next prove that (a) holds. Suppose that i ^ j, a\,a% <E Gi, b\, 62 G Gj, and 
a± > bi > 02 > 62- Then 0i,&ij02,&2 follow each other anticlockwise around the 
circle. Then, since a±, ai are in the same connected component of G, there is a path 
in E from a\ to 02, and similarly from b\ to 62. The arrangement of a\, 0,2, b\ and 
62 implies that these two paths cross, a contradiction. Hence no such arrangement 
can occur, and (a) is shown. 

We next prove that (b) holds. It is enough to prove the following claim: 

Claim: Let a E Gi. Then aa(a) is the next vertex of Gi (considered as a vertex 
of E) anticlockwise on the circle from a. 

To prove the claim, we note that, by the definition of <jq, ac(a) is connected to 
a by a path in G: 

Si 1 Si 2 s ip-i 

a — ai a 2 • • • • • ■ a p — aa{a) , 

where i\ < 12 < ■ ■ ■ < i p —i. Furthermore, a\ is not incident with any arc with 
symbol S r for r < ii, a p is not incident with any arc with symbol S r for r > i p -\, 
and, for 2 < j < p — 1, a,j is not incident with any symbol S r for ij—i < r < ij. 

The existence of the above path implies that <jg{o) lies in Gi. Since i\ < 12 < 
■ ■ ■ < i p and there are no crossings, the path must go clockwise around the circle; 
see Figure [21 The conditions above and the fact there are no crossings imply that 
no vertex in (aci®), a) has an arc with a vertex in (a, aci®)) or with a or fjQ{a) 1 
so these vertices are connected only amongst themselves. It follows that they do 
not lie in Gi and we see that (II) (b) holds. Thus G is a labelled m-edge-coloured 
forest satisfying (II). 

Conversely, suppose that we have a labelled m-edge-coloured forest with k ver- 
tices satisfying (II). Let E be the RNA m-diagram of degree k with an arc joining 
S r in vertex i with S r in vertex j if and only if there is an edge in G between 
vertices i and j coloured with symbol S r . We must check that E can be drawn with 
no crossing arcs, i.e. that it is noncrossing. 
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Figure 3. The part of E between a and a G (a). 



We do this by induction on the number of vertices. Suppose first that G has 
more than one connected component, i.e. that G is not connected. By induction, 
each component Gi corresponds to a noncrossing RNA m-diagram (on the vertices 
of Gi). 

Suppose that we had a\ > a 2 € Gi and b\ > b 2 6 Gj for two distinct components 
Gi and Gj, with arcs between a\ and a 2 and b\ and b 2 which cross in E. Then, going 
around the circle anticlockwise, starting at vertex k, we must encounter 01, b±, a 2 , b 2 
in order, or b\, a\, b 2 , a 2 in order. Swapping Gi and Gj if necessary, we can assume 
we are in the first case. But then a\ > 61 > a 2 > b 2 , contradicting (II) (a). Hence 
E is noncrossing. 

So we are reduced to the case in which G has exactly one connected component, 
i.e. G is connected. Suppose that vertex k is incident with edges ei, e 2 , ■ ■ ■ , e<< in G, 
coloured with symbols S ri , S r2 , . . . , SV d where r\ < r 2 < . . . < r^. Let the endpoints 
of these edges (other than k) be v\, v 2 , . . . , v&. Removing vertex k from G leaves 
precisely d trees T\, T 2 , . . . , containing vertices v\, v 2 , . . . , Vd respectively. By (b), 
we know that oq — SmSm—x • • • Si induces the permutation (k k — 1 • • • 1) on the 
vertices of G. 

We apply era to vertex k, and then repeatedly apply era- By its definition, each 
application of aa corresponds to following a certain path through G, i.e. passing 
along the edges corresponding to the symbols in the sequence S\ , S 2 , ■ ■ ■ , S m in that 
order, when such incident edges exist. Since the edge e\ has symbol S ri , and no 
edge incident with k has smaller symbol, it follows that, after the first application 
of aa, we obtain vertex k\ := k — 1 in T\. 

Since a a is a fc-cycle, after repeated application of aa, we must leave T\. Suppose 
that k 2 is the number of the first vertex reached outside T± . Since r 2 is the minimum 
number of a symbol adjacent to k greater than n, k 2 will lie in T 2 . Repeating 
this argument, we will obtain k > k\ > k 2 > ■ ■ ■ > kd > 1 such that vertices 
ki + \ + 1, . . . , ki lie in tree for i = 1, 2, . . . , d— 1. At the final step, the first vertex 
reached on leaving Td must be k. Since aa is a fc-cycle, all vertices must have been 
visited. 

Let kd+i — 0. It follows from the above that tree Tj contains precisely vertices 
fcj+i + 1, . . . , ki for each i. Thus, the numbering of the vertices of G is first the 
vertices of Td in some order, then the vertices of Td-\ in some order, then the 
vertices of Td- 2 , and so on, ending with the vertices of T\ and then finally k. 
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Each Ti will correspond (by the inductive hypothesis) to a noncrossing RNA m- 
diagram on its vertices. Thus the vertices vi,V2, ■ ■ ■ ,Vd in G will be numbered in 
decreasing order. The arcs in S from k to these vertices are numbered by symbols 
S ri , S r2 , . . . , S rd , respectively, with r% < r^ < ■ ■ ■ < r^. It follows these arcs do not 
cross each other or any of the other arcs in E. See Figure Hl^a) for an example, 
where v± = 9, n = 1, and V2 — 8, T2 — 4. Hence, £ is noncrossing and thus an 
object in (I) as required. 

It is clear that the two maps we have constructed are inverse to each other, so 
the theorem is proved. □ 

The following lemma follows easily from the definitions. 

Lemma 3.2. A noncrossing RNA m-diagram is connected if and only if the cor- 
responding labelled m- edge- coloured forest is connected, i.e. is a tree. 

Remark 3.3. (1) Since a forest on k vertices is connected if and only if it 
has exactly k — 1 edges, a noncrossing RNA m-diagram with k vertices is 
connected if and only if it has k — 1 arcs. 
(2) In the connected case, the circular order of G is just the permutation (kk — 
1 ■ • • 1) and we have a bijection between the following sets: 

The set of connected noncrossing RNA m-diagrams of degree k. 

t 

The set of labelled m-edge-coloured trees with k vertices and circular 
order (k k — 1 • ■ ■ 1). 

The labelling on the vertices for a tree in the latter set is determined by a 
distinguished vertex, that labelled k, say, since oq then determines the labels on 
all the other vertices. We thus have: 

Corollary 3.4. The bijection in Theorem \3.1\ induces a bijection between the fol- 
lowing sets: 

(a) The set of connected noncrossing RNA m-diagrams of degree k. 

(b) The set of rooted m-edge-coloured trees with k vertices. 

(c) The set of labelled m-edge-coloured trees with k vertices and circular order 
(kk-1 ••• 1). 

See Figure IUa)-(c) f° r an illustration of this bijection, i.e. a particular connected 
noncrossing RNA m-diagram of degree 10 on 4 symbols and the corresponding 
objects from (b) and (c) above. 

Remark 3.5. Following [39l Sect. 5], define a noncrossing RNA m-diagram of 
degree k to be saturated if it is maximal in the sense that adding any further arcs 
would destroy the noncrossing property. Note that saturated RNA m-diagrams 
have also been studied in [7]. By Remark l3.3f l). a noncrossing RNA m-diagram is 
connected if and only if it has k — 1 arcs. Thus, if any more arcs were added, it 
would no longer be noncrossing. It follows that any noncrossing RNA m-diagram 
which is connected is necessarily saturated. Finally, we note that the noncrossing 
RNA 3-diagram of degree 5 in Figure is saturated but not connected. 

In what follows, we need a bijection between the set of diagonal-coloured m- 
angulations of an (m — 2)k + 2-sided regular polygon up to rotation and the set 
of m-edge-coloured trees with k vertices. Such a bijection in the unlabelled case 
(between m-angulations up to rotation and plane (m — l)-ary trees) is well-known; 
see, for example, [3H Sect. 6.2]. Here we need a labelled version, and for convenience 
we give some details of how this can be done. We also note that the case m = 3 
appears in [2"3"j . 
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(a) A connected noncrossing RNA 4-diagram of degree 10. 
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(b) A rooted 4-edge-coloured tree with 10 (c) A labelled 4-edge-coloured tree with 10 
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vertices and circular order (10 9 ■■■!). 



FIGURE 4. Objects corresponding to each other under the rejec- 
tions in Corollary [ 



Theorem 3.6. There is a bijection between the following sets: 

(a) The set of m- edge- coloured trees with k vertices. 

(b) The set of diagonal-coloured m-angulations of an (to — 2)k + 2-sided regular 
polygon up to rotation. 

Proof. Given a diagonal-coloured m-angulation of an (to — 2)k + 2-sided polygon, 
take the dual graph, with a vertex in the middle of each TO-sided polygon in the 
m-angulation and an edge between two vertices coloured Si whenever the corre- 
sponding polygons share an edge in the dissection coloured S{. Since the edges of 
a polygon in the TO-angulation all have distinct colours, the same is true for the 
edges in the dual graph incident with a given vertex. Since there are no internal 
vertices in the ?n-angulation, there can be no cycles in the dual graph. It is clearly 
connected, hence it is a tree. By construction, it is TO-edge-coloured and has k 
vertices. 
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Figure 5. A noncrossing RNA 3-diagram of degree 5 which is 
saturated but not connected. 



Conversely, given an m-edge-coloured tree with k vertices, first complete it as 
follows. For each vertex, add extra edges coloured with all the symbols not already 
colouring the edges incident with the vertex. The other end of each additional edge 
added is a new vertex of valency 1 and is called a boundary vertex. Each original 
vertex has valency m in the completed tree and is referred to as an interior vertex. 

Next, map the completed tree to the plane so that the edges around each vertex 
are coloured with the symbols Si, S2, ■ ■ ■ , S m in clockwise order, with all edges of 
unit length, and so that the edges around any given vertex are equally spaced, 
with an angle of 2ir/m between successive edges. Note that vertices and edges may 
overlap. It is clear that any two such maps will be the same up to a rotation. 

Then, each interior vertex determines a regular m-gon in the plane, with the mid- 
points of the edges in the polygon given by the mid-points of the m edges adjacent 
to the given vertex in the original tree. The edges in the polygon are given the same 
colours as the edges in the completed tree that they cross. The collection of regular 
m-gons obtained (which may also overlap) forms an m-cluster in the sense of [16] 
(note that this is not the more modern notion of cluster introduced by Fomin- 
Zelevinsky [I2])- Then, as in [16, Sect. 7] there is an (orientation-preserving) 
homeomorphism of the plane taking the m-cluster to a (diagonal-coloured) m- 
angulation of P. The dissection obtained is well-defined up to a rotational symmetry 
of P (by the above remark concerning the map of the completed tree into the plane). 

It is clear that this map is the inverse of the map above so the result is proved. □ 

We remark that the completion of trees considered here is essentially (a general- 
isation of) the standard completion of binary trees (i.e. a bijection from (c) to (d) 
in [31 Ex. 6.19]). 

Corollary 3.7. There is a bijection between the following sets: 

(a) The set of labelled m-edge-coloured trees of relations with k vertices and 
circular order (k k — 1 • • ■ 1). 

(b) The set of rooted diagonal- coloured m-angulations of an (to — 2)k + 2-sided 
regular polygon up to rotation. 

Proof. We use the above bijection. Given a labelled TO-edge-coloured tree with 
k vertices, forget the vertex-labelling to obtain an m-edge-coloured tree with k 
vertices, and take the corresponding diagonal-coloured m-angulation together with 
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Figure 6. 4-angulation of a regular 22-sided polygon as in Corol- 
lary EL7|Jb) corresponding to the labelled 4-edge-coloured tree with 
10 vertices in Figure HJc) via the bijection in Corollary 13.71 The 
shaded region is the root m-gon. 



the distinguished m-sided polygon corresponding to the vertex in the original tree 
labelled k. 

Conversely, given a rooted diagonal-coloured m-angulation of an (m — 2)k + 2- 
sided regular polygon (up to rotation), ignore the root and take the corresponding 
m-edge-coloured tree with k vertices, S. The root gives a distinguished vertex of 
the tree, which we label k. Then label the vertex <Jg{k) with k — i, for i = 1, 2, . . . , k. 
It is clear that this numbering gives a labelling of the m-edge-coloured tree giving 
it circular order (k k — 1 • • • 1). □ 

See Figure[6]for the rooted diagonal-coloured 4-angulation of an (4 — 2) • 10 + 2 = 
22-sided regular polygon (up to rotation) corresponding to the labelled 4-edge- 
coloured tree with 10 vertices and circular order (10 9 • ■ ■ 1) in Figure Hfc). 

Corollary 3.8. The cardinality of each of the following sets: 

(a) The set of labelled m-edge-coloured trees with k vertices and circular order 
(kk-1 ■■■ 1); 

(b) The set of connected noncrossing RNA m-diagrams of degree k; 
is given by 

m /(m-l)fc\ 
k ' m (m-2)k + 2\ k-l )' 

Proof. Both sets have the same cardinality by Corollary 13.41 and, by Corollary 13.71 
they have the same cardinality as the set of rooted diagonal-coloured m-angulations 
of an (m — 2)fc + 2-sided regular polygon up to rotation. The number Sk, m of such 
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to- angulations without a root, with no labelling of diagonals and ignoring rotational 
equivalence is well-known (see e.g. [II])- Let C™ be the fcth Fuss-Catalan number 
of degree to: 



Cm 
h — T" 



mk 
k - 1 



1 



(m - l)k + 1 



mk 
k 



Then 



S, 



1 



k , rn 



jnm-1 _ 

k ~ (m - 2)k + 1 



(to — l)fc 
k 



Since there are k m-sided polygons in an m-angulation, there are k possibilities 
for the root. There are to possibilities for a labelling (once one diagonal is coloured, 
all other diagonals in the TO-angulation have determined colours using the rule 
that each m-gon must have its edges coloured Si, S 2 , ■ ■ ■ , S m clockwise around the 
boundary. Each orbit of diagonal-coloured rooted dissections under the action of 
the rotation group of the polygon contains (to — 2)k + 2 elements (the number of 
sides of P). Hence, we have: 

kuiSk.m 



(m - 2)k + 2 



and the result follows. 

Example 3.9. For to 

following table: 



□ 



3,4,5,6, the first few values of Tfc )OT are given in the 



k 





1 


2 


3 


4 


5 


6 


Tk,3 


1 


1 


3 


9 


28 


90 


297 


T M 


1 


1 


4 


18 


88 


455 


2448 


Tk,5 


1 


1 


5 


30 


200 


1425 


10626 


Tk,6 


1 


1 


6 


45 


380 


3450 


32886 



For to = 3, we have that: 



T, 



where C(r) = 
Prop. 7.5 



fe,3 - 

2r 
r 



2k 



fc + 2 V fc - 1 



C(k + l)-C(k), 



and is sequence A071724 in [30 

4 

TkA — 



is the rth Catalan number. This case is covered in 
For to = 4, we have: 



2fc 



3fc 
fc- 1 



which is sequence A006629 in [30]. The cases to = 5,6 do not appear in [50] . 

Remark 3.10. Let ir e Gk- In l3.Hl3~81 we could consider noncrossing RNA m- 
diagrams on vertices 7r(l), 7t(2), . . . , 7r(fc) (clockwise around the circle). Then the 
results [3TTII3.8l hold. with the circular order (fc fc — 1 • • • 1) replaced with the circular 
order a — (7r(fc) 7r(fc — 1) • • • tt(1)) satisfying a(w(i)) = n(i — 1). 
In particular, parts (a) and (b) of Theorem [XT] become: 

(a) If 7r(ai), 7r(a2) G Gi and 7r(bi), ^(62) G Gj for i ^ j, then we cannot have 
<Xi > b\ > a 2 > b 2 - 

(b) If 7r(a) G Gi for some i, then aci^ia)) is the vertex n(a') with a' less than 
a maximal such that 7r(a') £ Gi (or, if no such vertex exists, it is 7r(a') 
where a' is maximal such that 7r(a') lies in Gi). 

Furthermore, the number of connected labelled TO-edge-coloured trees with fc ver- 
tices and fixed circular order a given by a fc-cycle is given by the formula for Tk, m 
in Corollary [ 
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4. Link with Fuss-Catalan combinatorics 

In this section, we show that by restricting degree k noncrossing RNA m- 
diagrams appropriately, we get a bijection with the set (m — 2)-clusters of type 
Ak-i (in the sense of [TU]) or the set of m-angulations of an (to — 2)k + 2-sided reg- 
ular polygon. Since the number of either of these is known (a Fuss-Catalan number), 
this, together with an appropriate bijection, gives us an alternative way of counting 
the total number of RNA m-diagrams, i.e. an alternative proof of Corollary 13.81 

We also consider here m-angulations of a fixed polygon, not up to rotation. 
Note that such an TO-angulation can be considered as an m- angulation of a polygon 
together with a marked edge. The marked edge (which can be considered the base of 
the polygon) indicates how the TO-angulation can be rotated onto the fixed polygon 
we are interested in. 

We now state the result giving the bijections, emphasizing that the bijection 
between the sets in parts (3) and (4) is essentially a variation of Theorem 3.6 
(which uses [3H Sect. 6.2]), and that the bijection between (4) and (7) is in [TU1 
Sect. 5.1]. The set in (5) is a staging post, and the sets in (6) and (7) are included 
for interest. Thus the main content is linking the sets in (1) and (2) with each other 
and with the other sets listed. 

Theorem 4.1. The following sets are all in bijection. 

(1) The set of degree k+1 connected noncrossing RNA m-diagrams with (k+l)st 
vertex containing S± only; 

(2) The set of labelled m- edge- coloured trees G with k + 1 vertices and circular 
order (k+1 k ■ ■ ■ 1), with HI a vertex of valency 1 and the edge adjacent 
to it coloured Si; 

(3) The set of rooted m- edge- coloured trees with k+1 vertices and only one 
edge, coloured Si, adjacent to the root; 

(4) The set of m-angulations of a fixed (to — 2)k + 2-sided regular polygon; 

(5) The set of rooted m- edge- coloured trees with k vertices and no edge coloured 
Si adjacent to the root; 

(6) The set of rooted complete (to — l)-plane trees with k internal vertices; 

(7) The set of (to — 2) -clusters (in the sense of |10j ) of type Ak~i, i.e. maximal 
simplices of the (to — 2)-cluster complex of type Ak-i as considered in [10] - 

Proof. (l)<s->-(2): This is a restriction of the bijection in Remark 13.31 The fact that 
vertex k + 1 contains only the symbol Si corresponds to the fact that vertex k + 1 
in the tree has only one edge incident with it, coloured Si. 

(2) o(3): To go from an object in (2) to an object in (3), take vertex k + 1 as 
the root and forget the vertex-labelling. For the inverse map, the condition on the 
circular order, ac, in (2) determines the labelling, starting from k + 1 corresponding 
to the root. 

(3) o(4): Restricting the bijection in Theorem 13.61 gives a bijection between the 
set of objects in (3) and the set of rooted diagonal-coloured m-angulations of an 
(to — 2) (k + 1) + 2-sided regular polygon up to rotation such that the root m-gon 
has only one edge incident with another m-gon and that edge is coloured Si. 

Given such a rooted diagonal-coloured m-angulation of an (m — 2)(fc + 1) + 2- 
sided regular polygon Ai up to rotation, we remove the root m-gon to give (via an 
orientation-preserving homeomorphism) a rooted diagonal-coloured m-angulation 
of an (to — 2) k + 2-sided regular polygon. We mark the edge (which is coloured Si) 
and forget the diagonal labelling to obtain an m-angulation of a fixed (m — 2)k + 2- 
sided regular polygon, i.e. an object as in (4). Note that, because of the marked 
edge, the m-angulation we obtain is independent of the choice of M up to rotation. 
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Figure 7. The case k — to = 3 of the bijection between (3) and 
(4) in Theorem liTTl 



To go back, given an m-angulation M. of a fixed (to — 2)k + 2-sided regular 
polygon, we add a new m-gon M onto the boundary of M 1 with one boundary 
edge incident with the marked edge of M . This, via an orientation-preserving 
homeomorphism, gives an TO-angulation of an (to — 2) (k + 1) + 2-sided regular poly- 
gon, which we consider up to rotation. We colour the marked edge of M' with Si. 
This uniquely determines a diagonal-colouring of the rest of the m-angulation (since 
each m-gon must be have its boundary edges coloured Si, Sa, ■■■ , S m clockwise, by 
definition). It is clear that this gives an inverse to the above map. 

Figure [7] gives an explicit bijection between the set of objects in (3) and the set 
of objects in (4) in the case k = m = 3. We always draw the triangulation of the 
pentagon with the marked edge at the base. 

(3) o(5): Choose an object in (3), and let e be the edge coloured Si, incident 
with the root. Let the new root be the other vertex incident with e, then delete 
the old root and the edge e. We obtain a new rooted m-edge-coloured tree with 
k vertices and Si not colouring any edge incident with the new root. To go back, 
given an object in (5), add a new edge incident with the root and colour it Si. The 
other end is a new vertex which becomes the new root. 

(5)o(6): Given an object in (5), first complete it as in the second paragraph of 
Theorem 13.61 Embed it in plane using the cyclical order Si, S2, . . . , S m , Si of the 
edges around each vertex. Keep the root unchanged, and delete the edge incident 
with the root in the completed tree which is coloured with Si. Remove all the 
colours on the edges. We obtain in this way a rooted completed plane (to — l)-tree. 

To go back: given an object in (6), let the parental edge of a vertex be the edge 
between it and its parent. Regard the root as having a (non-existent) parental edge 
labelled Si for these purposes. Then, starting with the root and working inductively, 
given a vertex with parental edge coloured Sj , colour the edges linking it to its chil- 
dren clockwise from the parental edge in the order: Sj+i , Sj+2 , • ■ • , S m , Si , . . . , Sj—i. 
Finally, remove all the leaves. 

(4) o(7): See QH Sect. 5.1]. □ 

Figure [5] gives examples of objects corresponding to each other under the above 
bijections. For (7), we use the numbering of the vertices of the dodecagon anticlock- 
wise, Pi, P 2 , . . . , P12, from the root edge at the bottom of part (4) of the figure. 
Following [THl Sect. 5.1], we see that the 2-snake consists of diagonals linking pairs 
of vertices (Pi,Pio), (Pio^Ps), (P3,Ps) an( A (PsjPs), corresponding to the negative 
simple roots — a\, —ot\, —ot\ and —a\, respectively. This allows the corresponding 

2-cluster to be read off the 4-angulation as in [TU1 Sect. 5.1]. Here a« + \- otj is 

denoted by ay. 
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Si 5*2 S3 Si 




s 3 3 



Si 



s. 



Sa 



So 



Si 



8.1: A degree 6 connected noncrossing RNA 
m-diagram with 6th vertex containing Si 
only. 



Si 



5, 



Sa 



S 2 



8.2: A labelled 4-edge-coloured tree with 
6 vertices and circular order (6 5 4 3 2 1) 
with 6 a vertex of valency 1 connected to 
an edge with symbol Si. 



Si 

o 

8.3: A rooted 4-edge-coloured tree with 6 
vertices and only one edge, coloured Si, 
adjacent to the root. 



s. 




8.4: An m-angulation of an 
(m — 2)k + 2 = 12-sided regu- 
lar polygon (the marked edge 
is the one at the bottom of 
the diagram). 



Si 

o 

8.5: A rooted 4-edge-colourcd 
tree with 5 vertices and no 
edge coloured Si adjacent to 
the root. 




8.6: A rooted complete 
ternary plane tree with 5 
internal vertices. 



"14) "4) "34 
8.7: A 2-cluster of 
type Ai 



Figure 8. Objects corresponding to each other under the bijec- 
tions in Theorem 14. II 
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As previously remarked, the number of objects in (4) is known (see e.g. [17]) and 
is given by the Fuss-Catalan number C™ _1 . So we have: 

Theorem 4.2. The number of objects in (1) and (2) is 

1 / (to - l)k \ 

(m-2)fc + l V k ) ' 

The number of objects in (2) for the case to = 3 is counted in [3J Prop. 7.5]. 

Example 4.3. For to = 3, 4, 5, 6, the values of Sk, m for small k are given in the 
following table for convenience: 



k 





1 


2 


3 


4 


5 


6 


Sk,3 


1 


1 


2 


5 


14 


42 


132 


SkA 


1 


1 


3 


12 


55 


273 


1428 


Sk,5 


1 


1 


4 


22 


140 


969 


7084 


Sk,6 


1 


1 


5 


35 


285 


2530 


23751 



See [30], sequences A000108, A120588. The case m = 4 is sequence A001764, the 
case to — 5 is sequence A002293 and the case m = 6 is A002294. 

We remark that different sequences generalising the Catalan numbers appear in 
the enumeration of RNA m-diagrams of various kinds in [33] . See also [8] . Catalan 
numbers also appear in the enumeration of fc-noncrossing RNA secondary structures 
in [21]. 

In [351 Thm. 2.1], a recursion formula for the number of RNA secondary struc- 
tures (i.e. diagrams in the sense of [25J Chap. 2]) is given. The main technique is to 
partition the diagrams on n points according to whether (a) the point n at one end 
of the diagram is (i) incident with an arc or (ii) not incident, and then, in case (ii), 
according to (b) the other end point of the arc. We now use a similar technique to 
get a recursion-type formula for connected noncrossing RNA m-diagrams of degree 
k. 

This will give a quadratic formula for Tk t , n (the number of connected noncrossing 
RNA m-diagrams of degree k, or the number of labelled m-edge-coloured trees with 
k vertices and circular order (k k — 1 • • • 1)) in terms of the S v ^ m (the number of 
noncrossing RNA m-diagrams of degree v + 1 with (v + l)st vertex containing S\ 
only, or the number of labelled m-edge-coloured trees with v + 1 vertices, circular 
order (v + 1 v ■ ■ • 1) and v + 1 a vertex of valency 1 and edge incident to it coloured 
Si) with v varying. This allows us to give an alternative proof of the formula for 
T fe>m (Corollary EES). 

Lemma 4.4. There is a bijection between the set of connected noncrossing RNA 
m-diagrams of degree k and the disjoint union of 

(a) The set of connected noncrossing RNA m-diagrams of degree k + 1 with 
vertex k + 1 having symbol Si only and 

(b) The disjoint union (for v = 2, .. . , k) of the sets of pairs with the first con- 
sisting of a connected noncrossing RNA m-diagram of degree v with vertex 
v having symbol Si only and the second element consisting of a connected 
noncrossing RNA m-diagram with degree k — v + 2, vertex k — v + 2 having 
symbol Si only. 

Proof. Partition the set of connected noncrossing RNA 771-diagrams S of degree k 
by 

(a) whether there is an arc in X incident with Si in vertex 1 and 

(b) if there is such an arc, by the vertex v, 2 < v < fc, of its other end. 



c _ nm-i _ 1 ( (to - l)fc \ 

Sk,rn -C k - - I I 
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k+1 




3 



Figure 9. The case with no arc incident with Si in vertex 1. 

The set of structures with no such arc is in bijection with the set of connected 
noncrossing RNA m-diagrams of degree k + 1 with vertex k + 1 having symbol Si 
only (the bijection adds a new vertex, k + 1 with symbol Si only and an arc from 
Si in X, vertex 1, to vertex k + 1). See Figure 

Given a structure £ with an arc incident with Si in vertex 1, we obtain a 
connected noncrossing RNA m-diagram of degree v with vertex v having 5*1 only 
as a symbol, by restricting to vertices 1, — 1 and symbol Si of vertex v. 

We obtain a connected noncrossing RNA m-diagram of degree k — v + 2 with 
only symbol Si in degree k — v + 2 by considering the remainder of S not lying 
in the restriction above, adding the symbol Si to the vertex v, renumbering the 
vertices 1, . . . , k + 1 — v and then following the same procedure as in the case when 
there is no arc incident with Si in vertex 1 . See Figure 1101 

It is clear that this pair determines E, and the Lemma follows. □ 

Corollary 4.5. The number of connected noncrossing RNA m-diagrams of degree 
k on m symbols is given by 

k k 

Ti \ pm-l/-im-l \ * c c 

fc,m / v — 1 k+1 — v — / j ^v — l,m^fc+l — v,m- 

v—1 v—1 

Proof. This follows immediately from Lemma 14.41 with the first term (v = 1) 
corresponding to the case where there is no arc incident with symbol Si in vertex 
1 of the connected noncrossing RNA m-diagram structure of degree k. □ 

Lemma 4.6. [HI Eq. 5.63] 
Let n, r,s,t 6 with n > 0. Then we have: 

Er s ( tk + r\( t(n — k) + s \ r + s ( tn + r + s \ 

, Q tk + r t(n - k) + s \ k J \ n~k ) ~ tn + r + s\ n J 

(If a denominator factor vanishes, the formula still makes sense by cancelling it 
with an appropriate factor in the numerator of a binomial coefficient.) □ 

Note that the formula holds in greater generality but we shall not need it. We 
now have an alternative proof of Corollary 13.81 
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V 1 



5„ 

5*2 

Si 



k S m S 1S2 l 



Sn 



Si 

s 2 



S -n 



S-2 



Si 



1 Si Sn 



V 1 



k+ln^k v + 2 



J S2 



Si Si a v i-> 1 



5i 



Si 
S 2 

v + 1 h-> 2 



Sn 



Figure 10. The case with an arc incident with Si in vertex 1. 



Theorem 4.7. Fix integers m > l,k > 0. The cardinality of the set of connected 
noncrossing RNA m- diagrams of degree k is given by 



(m-2)fc + 2 V k-1 



(m — l)k 



Proof. By Corollary 14. 5 1 we have that 



k-l 



Tfc 5 77i — ^ S v ^vaSk — v ,m ^ ^ Sv^iSk—v^rn S^^rnt 
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since So, m = !• Hence, using Lemma 14.61 we have: 



A 



Tk,m — ^ 



(m-l)v + l\ 1 /(m - l)(fc - v) + 1 



— ' (to — 1)« + 1 \ u I (m — — v) + 1 \ fc — w 

^ — \ / \ / \ / \ / \ 

~ Sk,m 

2 /(m-l)fc + 2\ 1 /(m-l)fc + l 



(m-l)fc + 2\ fc / (m-l)fc + l\ k 

2 (to- l)k + 2 /(m- l)k + l\ 1 /(to - l)fc + 1 

(m - l)fc + 2 ' (to - 2)fc + 2 V fc / ~ (to - l)fc + 1 V k 

2 /(m-l)fc+l\ 1 /(m-l)fc + l 



(ro-2)fc + 2\ fc / (m-l)fc + l\ fc 
2 1 \ /(m - + T 



(m-2)fc + 2 + jfe 

2 1 \ (to - l)k + 1 /(m - l)k 



(m-2)fc + 2 (m-l)fe + iy fe V k ~ l 

((to - l)/s + 1)(2((to - l)k + 1) - ((to - 2)k + 2)) /(m - l)k 



k((m-2)k + 2)((m-l)k+l) \ k-1 

to / (m — l)fc^ 
~ (m ~2)k + 2 V jfe - 1 
as required. □ 

5. Convolution 

In this section we we use a similar technique to show that the sequence Tfc jTn , 
k = 1,2,... can be regarded as an TO-fold convolution of the sequence Sk,mi k — 
0, 1,2,.. .. We fix a symbol S r in the following. 

Lemma 5.1. The following sets are in bijection: 

The set of connected noncrossing RNA m-diagrams of degree k + 1 in which there 
is an arc between vertex 1 and vertex k + 1 with symbol S r . 

X 

The set of connected noncrossing RNA m-diagrams of degree k + 1 in which there 
is an arc between vertex 1 and vertex k + 1 with symbol Si . 

Proof. Given a structure in the first set, note that there can be no arcs incident 
with vertex 1 with symbol St with t < r and no arcs with vertex k + 1 with symbol 
St with t > r (since the structure is noncrossing). Similarly, given a structure 
in the second set, there can be no arcs incident with vertex k + 1 with symbol 
St with t > 1. It follows that moving an element E of the first set r — 1 steps 
anticlockwise, each step moving each arc one symbol anticlockwise in the diagram, 
gives a noncrossing RNA TO-diagram £' of degree k + 1 in which there is an arc 
between vertex 1 and vertex k + 1 with symbol Si . Since £' has the same number 
of arcs, it must be connected by Remark 13. 3f 1). Hence £' is a structure in the 
second set and we get a map from the first set to the second set. It is clear that 
the inverse of this map is shifting r — 1 steps clockwise. □ 

We note that removing symbols S%, S3, . . . , S m from vertex k+1 from an element 
of the second set makes no difference, since vertex k + 1 can only have an arc to 
vertex 1 with symbol Si by the noncrossing condition. 

Corollary 5.2. The number of connected noncrossing RNA m-diagrams of degree 
k + 1 on m symbols in which there is an arc between vertex 1 and vertex k + 1 with 
symbol S r is equal to Sk m - 
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Proof. Use Lemma RTTl and Theorem l4.il noting that an object as in Theorem l4.1f a) 
must have an arc between vertex k + 1 and vertex 1 with symbol Si by the non- 
crossing and connectedness conditions. □ 

Proposition 5.3. Let >Sfe irn ,Tfe )TO be as above and set k > 1. Then 

fei>0,...,fe m >0,fciH hfe m =fc-l 

i.e. the sequence T\, m , T 2 . m , ■ ■ ■ is the m-fold convolution of the sequence So !Tn , 

<Sl,m, 

Proof. Let E be a connected noncrossing RNA m-diagram with k vertices, with 
corresponding labelled m-edge-coloured tree G. Recall (see Remark I3.3[) that 
<tg(1) = k, i.e. applying the symbols S\, S2, ■ • • , S m in order to the vertex 1 takes 
us from 1 to k. Suppose that applying symbol Sj takes us from vertex i to vertex 
i + kj for j = 1,2, ... ,m. Note that kj > for all j by the noncrossing condi- 
tion. We see that £ corresponds to the joining of m connected noncrossing RNA 
m-diagrams Si, ... , S m , where Hj has degree kj + 1, and in which there is an arc 
between vertex 1 and vertex kj + 1 in Sj with colour Sj . Note that we must have 
&! + •-• + k rn = k — 1. Thus we have a bijection between the set of connected non- 
crossing RNA m-diagrams of degree k and the set of such m-tuples. The formula 
in the proposition follows from this and Corollary 15.21 □ 

Remark 5.4. We saw in Theorem 14.21 that the sequence So, m , Si, m , . . . is the 
Fuss-Catalan sequence of degree m — 1. Thus we have shown that the sequence 
Ti, m , Ti,mi ■ ■ ■ is the m-fold convolution of the Fuss-Catalan sequence of degree 
m - 1. This fact appears in [201 §3]. See also [H] (sequences A071724 and A006629) 
for discussion of the cases m = 3 and m — A. 

6. The total number of trees of relations 

In Section [3] we counted the number of labelled m-edge-coloured trees with k 
vertices and a fixed circular order. We now would like to count the total number 
of labelled m-edge-coloured trees with k vertices, i.e. with arbitrary circular order. 
By Remark 13.101 the number of labelled m-edge-coloured trees with k vertices and 
fixed circular order a (where a is a fc-cycle) is given by 

m / (m-l)fc\ 

k ' m ~ {m-2)k + 2\ fc-1 ) 

We also have: 

Lemma 6.1. Let G be a labelled m-edge-coloured tree with k vertices. Then the 
circular order oq of G is a k-cycle. 

Proof. This is clearly true if = 1 or 2. Suppose it holds for smaller values 
of k. Let v be a vertex of G which is not a leaf. Suppose that v is incident 
with edges ei,e2, . . . ,e<j in G, coloured with S ri , S r2 , . . . , S rd respectively, where 
fi < 7"2 < • • • < r&. Let the end-points of these edges (other than v) be v%, i>2, • • • , Vd- 
Removing v from G leaves d subtrees G[, G' 2 , . . . , G' d incident with vi, ua, • • • , «d 
respectively. Let Gi be the subtree G\ with v and reattached to Vi. 

By the inductive hypothesis the are all cycles. Hence, repeatedly applying 
(TGi to v cycles through the vertices of G\. Since gq — o'd on all vertices of G\ 
except w = (Tq i (v), repeatedly applying a a also cycles through all the vertices of 
G\. Since r 2 is minimal such that S r2 is a symbol colouring an edge incident with 
v with r2 > r\, og(w) will lie in G' 2 . In fact o~g{w) = uq 2 {v), since in G2, v is not 
incident with any edge with symbol other than S T2 . Repeatedly applying uq then 
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cycles through the vertices of G' 2 before coming back to a G }(v). Repeating this 
argument, we see that repeatedly applying uq to v first cycles through G[, then 
through G'%, G' 3 , . . . , G' d in order before eventually returning to v. □ 

(Note that it follows that the circular order of an m-edge-coloured tree is also 
a cycle.) We thus see that the number of possible circular orders of labelled m- 
edge-coloured trees with k vertices is the number of fc-cycles in i.e. (k — 1)!. It 
follows that: 

Proposition 6.2. The total number of labelled m-edge-coloured trees of relations 
with k vertices is: 

m((m — l)fc)! 
' = ((m-2)fe + 2)! 

Proof. By the above discussion and Lemma 16.11 we have: 
Uk, m = T k , m {k - 1)! 

= m ((m-l)fc)! _ 

((m-2)fc + 2) (fc-l)!((m-2)fe + l)P ; ' 
m((m — l)fc)! 
= ((m- 2)fc + 2)!' 



□ 



Remark 6.3. This is already known, and appears in j32, 5.28, pl24], credited to 
I. Gessel. Stanley asked whether there was a simple bijective proof of this fact. A 
bijective proof was subsequently given in [6]. We have thus obtained here a new 
bijective proof of this fact. Note that Uk, m can be rewritten as: 

m(fc-2)!(((m-l)fc)!) __ fu ^({m-l)k^ 

k 



= m{k - 2)! 



{{m-l)k-{k-2))\{k-2)\ "~ v ~ '\ k-2 
Example 6.4. For m = 3, 4, 5, 6, we give below some values of Uk m for small k. 



k 


1 


2 


3 


4 


5 


6 


Uk,3 


1 


3 


18 


168 


2160 


35640 


Uk,4 


1 


4 


36 


528 


10920 


293760 


Uk,5 


1 


5 


60 


1200 


34200 


1275120 


U k , 6 


1 


6 


90 


2280 


82800 


3946320 



Putting together the results of the previous sections with the above discussion, 
we have: 

Corollary 6.5. There are bisections between the following sets: 

(1) The set of pairs consisting of a rooted diagonal- coloured m-angulation of 
degree k up to rotation and a k-cycle; 

(2) The set of (m-gon) -labelled, diagonal- coloured m- angulations with k m-gons 
up to rotation; 

(3) The set of labelled m-edge-coloured trees with k vertices. 
All of the above sets have cardinality: 

_ m(((m - l)fc)!) 
Uk ' m ~ (( m -2)fe + 2)f 

Proof. The set of labelled m-edge-coloured trees with k vertices and a fixed circular 
order is in bijection with the set of rooted diagonal-coloured m-angulations of degree 
k up to rotation by Corollary 13.71 (see also Remark 13. 10[) . Thus mapping such a 
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tree to its corresponding m-angulation together with the circular order of the tree 
gives a bijection between (3) and (1), using Lemma IrTTl To go between (2) and (3), 
argue as in Theorem 13.61 (the m-gon-labelling of the m-angulation corresponds to 
the vertex-labelling of the tree) . □ 

7. Generalised Induction 

In this section we give the definition of our generalised induction on labelled 
m-edge-coloured trees with k vertices, generalising the induction in the case m = 3 
defined in [5] . The induction in [5] leads in [5] to the construction of new languages 
generalising the Sturmian languages, used to study fc-interval exchange transfor- 
mations. Induction generates new labelled m-edge-coloured trees with the same 
number of vertices starting with a given such tree, and the transitive closure is an 
equivalence relation. We show that the circular order is an invariant, giving rise to 
a classification of the equivalence classes by fc-cycles in &k- 

Given an labelled m-edge-coloured tree G and integers i, j £ {1, . . . , m} we define 
a maximal Si — Sj chain B in G to be a (linear) subtree of G whose edges are only 
coloured Si and Sj such that no other edges incident to B are coloured by Si or Sj . 

Definition 7.1. Let G be a labelled m-edge-coloured tree with k vertices. Fix 
i,j £ {1, . . . , m} with i < j. Let B be a maximal Si — Sj chain in G. Define 
Rf.{G) to be the labelled m-edge-coloured tree with k vertices obtained from G by 

• first removing all subtrees in the complement of the maximal chain B 

• interchanging the vertices of each edge of B coloured by Sj 

• interchanging the symbols Si and Sj on the whole maximal chain B 

• reattaching the previously removed subtrees to B at the vertices with the 
same label they were removed from. 

Similarly, define LfJG), where in the second bullet point in the above definition 
we interchange the vertices of each edge coloured by Si rather than those coloured 
by Sj. We also set Rf :— R^ i+ i and Lf :— Lf i+l and we will write Ri and Li if 
B is clear from the context. 

Lemma 7.2. Let i,j £ {1, . . . , m} with i < j and B be a maximal Si — Sj-chain 
with no incident edges coloured by Si+i, . . . , Sj—j.. Then the induction Rf^(G) is a 
product of inductions of the form Ri for I = i, i + 1, . . . , j — 1. 



Proof. Suppose first that B has the following form: 

Si Sj Si Si 

a\ et2 03 et4 • • • a, — i a r . 

Then Rfj(G) is the chain 

Sj Si Sj Si Sj 

B = ai &3 »2 Q-5 o-i ■ ■ ■ a r -2 a r . 

On the other hand, applying Ri, R4+1, . . ., Rj-2, in order (in each case to 
all the maximal chains of appropriate type contained in B), we obtain the maxi- 
mal Sj-i — 5,-chain ai — 0,2 — - — a 3 — — — 04 • • • a r _i 3 a r . 



Next, apply Rj-i to the whole chain B. Then, applying Rj-2, Rj—3t ■ ■ •> Ri m de- 
creasing order (in each case to all the maximal chains of appropriate type contained 
in B) gives the chain B' . 

The proof works in a similar way for the other configurations of maximal Si — Sj- 
chains, i.e. the chains of the form 

Si S 3 St Sj 

ai et2 03 04 ■ ■ • a, — 1 a r 

and 

Sj Si Sj Sj 

a\ 02 0,3 04 ■ • • a r -i a r . 
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□ 

Remark 7.3. (1) In the above proof, we may replace the inductions Ri with 
inductions Li for I = i, i + 1, . . . , j — 1. 

(2) If we replace Rj-i with Lj_i in the above proof we obtain the induction 
Lfj{G) instead. 

(3) Induction can also be defined on m-edge-coloured trees: For an m-edge- 
coloured tree with k vertices, choose an arbitrary vertex-labelling, apply induction, 
and then remove the vertex labelling. It is clear that this is independent of the 
vertex-labelling chosen. 

(4) The inductions Rfj and Lfj are mutually inverse maps. 

Definition 7.4. Call two labelled m-edge-coloured trees with k vertices G, G' in- 
duction equivalent if there is a sequence of inductions taking G to G', either of the 
form Li, for 2 < i < m — 1 or of the form Ri, for 1 < i < m — 2. Clearly this is a 
reflexive relation. It is symmetric since Li and Ri are inverse maps and it is easy 
to see that it is transitive. Hence it is an equivalence relation. We write r(G) for 
the equivalence class containing G. 

The inductions Ri and Li and the above notion of induction equivalence are 
the ones that behave well with respect to the circular order, as we shall see below. 
However, we shall also denote by r ffen (G) the equivalence class of G under the 
equivalence given by the more general inductions Rij and Lij. 

Proposition 7.5. Let G be a labelled m-edge-coloured tree with k vertices. Then 
there exists a labelled m-edge-coloured tree with k vertices whose edges are only 
coloured by S± and S m , and a sequence of inductions, each of the form Rij (with 
j < m — 1) or of the form Li j (with i > 2) taking G to G*. 

Proof. We first show that there exists a labelled m-edge-coloured tree G2 with k 
vertices that is induction equivalent to G, none of whose edges is coloured with S2 , 
by removing the symbols S2 one by one. Firstly, remove all edges coloured with 
symbols S4, ... , S m and call the resulting labelled 3-edge-coloured tree G. By [2J 
Prop. 5.2] there exists a sequence of inductions of the form R\$ and £2.3 taking G to 
a labelled 3-edge-coloured tree G2 with no edge coloured S2 ■ Let G2 be the labelled 
m-edge-coloured tree G2 with the detached edges reattached (to the vertices with 
the same label). Since none of the detached edges are coloured with Si, S2, or S3 
this sequence of inductions also takes G to G2 by identifying maximal chains in G 
with corresponding maximal chains in G. 

Suppose we have shown that G is induction equivalent to G;_i, where G;_i has 
no edges coloured S2, • ■ ■ , S1-1. 

Then start by detaching all edges coloured S1+2, ■ ■ ■ , S m . Call the resulting la- 
belled m-edge-coloured tree G. By [2 Prop. 5.2] there is a sequence of inductions of 
the form Rij and L^i +1 taking G to Gi, where G; has no edges coloured S2, . . . , Si- 
Reattach the detached edges and call the resulting labelled m-edge-coloured tree 
G;. Since none of the reattached edges are coloured by Si, Si, or S1+1, this se- 
quence of inductions takes Gi-i to G; by identifying the maximal chains in G with 
the maximal chains in G;_i. Note that none of the symbols S2,-..,Si appears 
in Gi. Hence, by induction on I, we can construct G TO _i = G» with no symbols 
5*2, . . • , S m -i and a sequence of inductions, each of form R^ (with j < m — 1) or 
L^ (with i > 2) taking G to G». □ 

Remark 7.6. In the above proof, the inductions R\ \ are applied in a situation 
satisfying the hypotheses of Lemma l7.21 and hence each can be written as a product 
of the inductions Ri, . . . , Ri-\. 
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Corollary 7.7. Let G be a labelled m- edge- coloured tree with k vertices. Then there 
exists a labelled m- edge- coloured tree with k vertices, G„, containing only symbols 
Si and S m and a sequence of inductions, each of the form Ri or Li, taking G to 
G* . 

Corollary 7.8. Let G be a labelled m- edge- coloured tree with k vertices. Then every 
possible m- edge- coloured tree with k vertices appears as the underlying unlabelled m- 
edge-coloured tree of a labelled m-edge-coloured tree in T(G). 

Proof. Let S and S 1 be arbitrary m-edge-coloured trees with k vertices. Then, by 
Corollary 17. 71 S is induction equivalent to an m-edge-coloured tree with k vertices 
containing only the symbols Si and S m ; similarly for S' (see Remark [773^3)) . If k 
is odd there is only one such tree: 

S m Si S m Si 

• • • • • • • • • 

and it follows that S and S' are inductively equivalent. If k is even, there are two 
such trees: 

Sm Si Sm S m 

S m : • • • • • • • • • 



~ ^1 ^1 *->l 

6i : • • • • • • • • • . 

Then Rf r ^ h (S m ) = Si, so S m is induction equivalent to Si by Lemma ITT21 It follows 
that S and S' are induction equivalent in this case also. 

Given a labelled m-edge-coloured tree with k vertices, G, whose underlying un- 
labelled tree is <S, and an arbitrary m-edge-coloured tree with k vertices, S' , the 
above shows that S and S' are inductively equivalent. It follows that G and a 
vertex-labelling of S' are inductively equivalent, and the result follows. □ 

Remark 7.9. It follows from the above proof that every possible m-edge-coloured 
tree with k vertices appears as the underlying unlabelled tree of a tree in Teen (G) . 

Corollary 7.10. If there is a sequence of general inductions of the form Rij and 
Lij between two m-edge-coloured trees with k vertices then they are induction equiv- 
alent (i.e. equivalent under Li and Ri induction). 

Proof. This follows immediately from Corollary 17.81 since in fact any two m-edge- 
coloured trees with k vertices are induction equivalent. □ 

We shall see below that the corresponding result does not hold for labelled trees 
(see Remark l7.15p . 

Lemma 7.11. (a) Let k be odd, let i ^ j and let G be a labelled m-edge-coloured 
tree with k vertices and whose edges are coloured with Si and Sj only. Let S be 
the underlying unlabelled tree of G. Applying Rij induction on G has order k, 
producing k distinct trees with shape S. 

(b) Let k be even, i ^ j and let G be a labelled m-edge-coloured tree with k 
vertices and whose edges are coloured with Si and Sj only. Let S be the underlying 
unlabelled tree of G. Applying Rij induction to G has order k, producing k/2 
distinct labelled trees whose underlying unlabelled tree is S in each case and k/2 
distinct labelled trees whose underlying unlabelled tree is Rij(S). 

Proof, (a) Since S contains only the symbols Si and Sj, it is a line. Suppose the 
line is drawn horizontally and suppose the leftmost edge has colour Sj and vertices 
ai and a-i from left to right. Then in Rfj(G), if it is drawn with orientation given 

S- s ■ 

by • — 0,1 ■ (where one of the edges may not exist), the vertex ai is the 
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d th vertex from the left. It is clear that all the induced trees Rfj(G) have the same 
underlying unlabelled tree (those for d odd should be read from right to left). 

(b) The proof is similar to the one in (a), except that for d odd, the underlying 
unlabelled tree of {R hJ {G)) d is Ri,j(S). □ 

Remark 7.12. (1) Since in the context of Lemma \7. Ill the underlying unlabelled 
tree of G is a line, we can replace Rij in Lcmma [7.11l bv Ri . . . Rj_2Rj-iRj-2 ■ ■ ■ Ri- 
(2) If we replace the R induction in Lemma l7.11l bv L induction, the result holds 
and is proved by a similar argument. 

Lemma 7.13. Let G be a labelled m- edge- coloured tree with k vertices containing 
a maximal Si-Sj chain B with no incident edges coloured Si, i < I < j . Then the 
circular order, gq, is unchanged after Rij or Lij induction on B is applied. 

Proof. We show that the circular order is invariant under i?ij-induction in the 
context given (the proof for Ljj-induction follows a similar argument). 

Let G be a labelled m-edge-coloured tree with k vertices and circular order 
<tg and containing a maximal .Si-Sj -chain B such that no edges coloured Si, for 
i < I < j are incident to B. Let G' = Rfj(G) with circular order oqi. Let i < j 
and let a be a vertex in B. Let S[, . . . S' m denote the maps corresponding to the 
symbols S\,...,S m in the labelled m-edge-coloured tree G' . Then SjSiSj(a) = 
Sj(a), Sj(a) — S-(a) and, for I ^ j, Si (a) — S ; '(a). Similarly, after applying Ljj- 
induction, we have SiSjSi(a) — S'^a), Si (a) — S'Ja) and, for I ^ i, Si (a) — S[(a). 
There are three possible situations to consider. 

Case 1 : Suppose first that a has an incident edge coloured with Si with 1 < / < i. 
We assume that / is minimal. Let T be the subtree of G \ B connected to a via 
this edge. Then acia) lies in T. Applying the induction R?. to G results in an 
m-edge-coloured tree G' in which a is reconnected to T by the edge coloured with 
Sk- Therefore ac(a) = aG>(a). 

Case 2: Suppose that a is not incident with any edges coloured with S r for 
r < i, but that the vertex SjSi(a) has an incident edge coloured with the Si where 
j < I < m. We take / minimal. Let T be the subtree of G \ B connected to 
SjSi(a) via this edge. Then a a (a) lies on the subtree T. Applying Rf^{G) results 
in an m-edge-coloured tree G' such T is reconnected, via the edge coloured Sj, to 
S'jS'iia) = S j S l {a). Therefore a G {a) = a G ,(a). 

Case 3: Suppose that neither a nor SjSi{a) has any incident edges coloured Si, 
for I 7^ i, j. Then Si (a) = a for all I ^ i,j and Si(SjSi(a)) = SjSi(a) for all I ^ i,j. 
By our assumptions, for i < I < j, we have that S'i — Si on the whole maximal 
chain. Hence, using the relations from the beginning of the proof, 

o- G '{a) = S' m ---S[(a) 

— ' ' ' Sj+iSjS^a) 

— S'm ' ' ' S'j+lSjSi( a ) 

= SjSi(a) 

— S m ■ ■ ■ Sj+iSjSi(a) 
= S m ■ • ■ Si (a). 

□ 

Corollary 7.14. The circular order of a labelled m-edge-coloured tree with k ver- 
tices is invariant under Ri and Li induction. 



Proof. In Lemma 17.131 assume that j = i + 1. 



□ 
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s 2 



G Sx S 3 St 



S 



1 2 3 4 — 5 

Rl3 



G' 



S 3 Si S 3 Si 
1 3 2 — 5 



G 

S 2 



Figure 11. i?i3-induction does not preserve the circular order: 
ctg(3) = 6 while <tg'(3) = 5 

Remark 7.15. In general, the circular order of a labelled m-edge-coloured tree 
with k vertices is not invariant under general Rij and Lij induction for i < j + 1: 
in the above proof suppose that we have an edge e coloured Si, for i < I < j 
connected to Si (a) in the chain B of G and let T be a subtree of G connected to 
e. Then ac(a) lies on the subtree T. On the other hand in RfAG), the edge S, 



is connected to Si(a) = SlSjS^a) and thus <JG'( a ) ^ cr G( a ) hi general. For an 
example of this with i = 1 and j ' = 3 (and T just consisting of one vertex), see 
Figure [TT] 

Note that this means that it is not possible, in general, to write Rij as a composi- 
tion of inductions of form Ri and Li, despite Lemma FT2l (which says that sometimes 
this is possible). 

We finally obtain: 

Theorem 7.16. Let G and G' be labelled m-edge-coloured trees with k vertices. 
Then G' is in T(G) if and only if oq = o~qi . 

Proof. Suppose that G' is in T(G). It then follows directly from Lemma \1 . 141 that 
o~q = o~g' ■ Conversely, suppose that oq — o~qi . By Corollary 17.71 and the proof 
of Corollary 17.81 there exist labelled m-edge-coloured trees with k vertices, G*, in 
r(G) and G'„ in T(G') where the underlying unlabelled trees of G* and G'„ contain 
symbols Si and S m only. 

Suppose that k is odd and that G* and G'„ are as follows: 

„ St Sj s, Sj 

G* : ai 0,2 a 3 04 ■■• a r -i a k 



g: 



Si , s , s, , , s, 



Then uq, = (aia 2 a 4 . . . a k ^iakOk~2 ■ ■ ■ a 3 ) = (a'^a^ . . . a'k-i a 'k a 'k-2 ■ ■ ■ a 'i) = a G', ■ 
Because of its underlying unlabelled tree, G'„ is determined by al x and its circular 
order. Thus, given G* there are at most k possibilities for G* such that o~g, = o'G' 
holds. 

By Lemma 17.111 Remark l7.121 and Lemma [7.141 repeatedly applying either Rij- 
induction or ij^-induction to G» gives k distinct labelled m-edge-coloured trees H 
with k vertices satisfying ug, = &H- Hence any such that og, = o~g^ must be 
one of these trees H. Therefore G'„ is in T(G») and thus G' is in T(G). 

Suppose that k is even. In this case, there are at most k/2 distinct possibilities 
for G'„ because of the symmetry of the underlying unlabelled tree. The result then 
follows from Lemma [7. Ill Remark 17.121 and Lemma [7. 141 as above. □ 
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A special case of this result was proved in [2J Prop. 6.2]. 



8. Induction on to- Angulations of Polygons 



As we have seen in Section[31 the set of labelled TO-edge-coloured trees with k ver- 
tices is in bijection with the set of m-gon-labelled diagonal-coloured TO-angulations 
of a polygon P n with n — (m — 2)k + 2 sides up to rotation (recall that m-gon- 
labelled means that the TO-gons in the TO-angulation are labelled with the num- 
bers 1,2, ... ,k). The colours on the diagonals bounding each TO-gon in the TO- 
angulations are the symbols Si, ... , S m in a clockwise order around the boundary 
of the rn-gon. Similarly, the set of TO-edge-coloured trees with k vertices is in bi- 
jection with the set of diagonal-coloured TO-angulations of P n with k m-gons up to 
rotation. 

Our aim in this section is to rewrite induction in the language of (to — 2)-clusters. 
We saw in Theorem l4.1l that the set of TO-angulations of P n containing k TO-gons is in 
bijection with the set of (to — 2)-clusters in the sense of Fomin-Reading [TU] of type 
Ak-i. For such clusters, mutation corresponds in the TO-angulation to rotating a 
diagonal one step anticlockwise in the subpolygon with 2to — 2 sides obtained when 
the diagonal is removed. Such mutation is also well-defined on an m-angulation of 
P n up to rotation. We shall see that induction has a description as a composition 
of such mutations (which we shall refer to as anticlockwise diagonal rotations): see 
Propositions 18.51 and 18.61 below. 

Lemma 8.1. Every m-angulation A4 of a polygon has at least two m-gons with 
m — 1 boundary edges or is an m-angulation of an m-gon. 

Proof. The result is clearly true if Ai is an m-angulation of an rn-gon. Let A4 be 
an m-angulation of P n , and assume that all m-angulations of polygons with fewer 
sides have two boundary m-gons or are just one m-gon. Cutting Ai along one of 
its m-diagonals D gives two m-angulations of polygons with fewer sides. By the 
induction hypothesis, each of these polygons contains an m-gon incident with its 
boundary and the result follows. □ 

In the following two lemmas we describe how to use anticlockwise rotations of 
diagonals to rotate an m-angulation of P n (containing k m-gons) anticlockwise 
through 2ix jn. Note that any clockwise rotation can easily be achieved via a com- 
position of anticlockwise rotations; we shall therefore sometimes use clockwise ro- 
tations. We label the vertices of P n 1,2, ... ,n clockwise around the boundary, and 
use the notation [i, j] to describe a diagonal in the polygon connecting vertex i with 
vertex j . 

Lemma 8.2. Let M. be an m-angulation of P n by k m-gons with diagonals [l,m], 
[1, to+(to — 2)], [1, to+2(to — 2)], . . . , [1, TO+(fc — 2) (to — 2)] . Then there is an explicit 
sequence of diagonal rotations taking M to its rotation through 2ir/n anticlockwise. 

Proof. We apply the following anticlockwise diagonal rotations: 



[l,m+ (fc-2)(m-2)] [n,m + (k - 2)(m 

[1, m + (fc — 3)(m — 2)] — > [n, m + (k — 3)(to 



2) 
2) 



1] 
1] 



[1, m + r(m — 2)] 



— > [n, m + r(m — 2) 



1] 



[l,m] 



— > [n, m — 1] 
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This produces an m-angulation M 1 of P n with diagonals [n,m + (k — 2)(m — 
2) — 1], [ri, m + (k — 3)(m — 2) — 1], . . . , [n, m + r(m — 2) — 1], . . . , [n, to — 1] which 
corresponds to an anticlockwise rotation of M through 2n/n. □ 

Lemma 8.3. Let M be an m-angulation of P n containing k m-gons. Then there 
is an explicit sequence of diagonal rotations taking A4 to its rotation through 2n/n 
anticlockwise. 

Proof. Suppose P n has an m-angulation M by k m-gons. Let M be an m-gon 
with m — 1 boundary edges and one internal edge e joining vertices [i, i + (m — 
1)]. Let R be the union of the m-gons incident with i. Apply Lemma 18.21 to 
the induced m-angulation of R to rotate it one step anticlockwise. Let R 1 be 
the subpolygon R with the m-gon M' with vertices i — + 1, . . . , i + (m — 
1) — 1 removed. Apply the reverse sequence to the one described in Lemma 18.21 
to rotate the m-angulation of R' one step clockwise (recalling that a clockwise 
rotation of a diagonal coincides with a composition of anticlockwise rotations of 
the same diagonal). Consider the polygon Q which is given by removing the m-gon 
M' from P n and transform it (using an orientation-preserving homeomorphism) 
to a regular n — (m — 2)-gon P n —(m-2)- Since P n —(m-2) nas fewer sides than 
P n we have inductively constructed a sequence of anticlockwise diagonal rotations 
rotating the m-angulation of P n -( m -i) anticlockwise through lix j(n — (m — 1)). 
Applying the corresponding sequence to the m-angulation of Q takes M. to its 
rotation anticlockwise through 2ir/n as required. □ 

Definition 8.4. Let M be an m-angulation of P n , diagonal-coloured with symbols 
S\ , S2 , ■ ■ ■ , S m . If there is at least one internal diagonal in M and all of the internal 
diagonals of M. are coloured only with Si or Sj for fixed i,j, we call A4 a snake 
m-angulation. A subpolygon of P n with this property is called a snake subpolygon. 
Note that in any snake subpolygon the internal diagonals must be of the form 
[11,12], [i2>*3]) an d so on, and the internal angle between diagonals [i r _i,i r ] and 
[i r , i r +i] must alternate between being positive and negative as r increases. 

We note that such snake m-angulations appeared in the context of cluster al- 
gebras in Fomin-Zelevinsky's article [HI Sect. 3.5] (for the case m = 3, i.e. trian- 
gulations) and appeared for general m in [lUl Sect. 5.1] under the name m-snake. 
We now show how they can be used to describe Ri induction as a sequence of 
(m — 2)-cluster mutations, i.e. anticlockwise diagonal rotations. 

Firstly, we consider induction on labelled m-edge-coloured trees with k vertices. 
The set of such trees is in bijection with the set of diagonal-coloured m-angulations 
of P n up to rotation. 

Induction on diagonal-coloured m-angulations. 

Let M. be a diagonal-coloured m-angulation of P n and let 1 < i < m. Choose a 
maximal snake subpolygon B of M. with internal diagonals coloured Si or Si+i. 
Let Tlf{M) be defined using the following procedure. 

Step 1: It is easy to see that the induced m-angulation of B contains exactly 
two m-gons with to — 1 boundary edges in B. 

We fix Mi to be one such m-gon, and take e to be the internal edge of Mi in 
B. Let Mi be the unique m-gon adjacent to Mi, M 3 the unique m-gon adjacent to 
M2, and so on, with Mi the unique m-gon in B adjacent to M;_i for each I. 

If e has colour S%, rotate the diagonal between M 2 and M 3 one step anticlockwise, 
then rotate the diagonal between M4 and M5 one step anticlockwise and continue 
like this until there are or 1 m-gons left in B. 

If e is coloured 5»+i, rotate the diagonals between Mi and M 2 , M 3 and M 4 , etc. 
one step anticlockwise until there are or 1 m-gons left in B. 
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Exchange the colours of the edges coloured with Si and Sj+i in B and recolour 
the boundary edges in B as required, using the rule that each m-gon in B must 
have the symbols Si , S% , . . . , S m clockwise on its boundary. Note that this only 
changes colours on the boundary edges of m-gons M in B with a single internal 
edge in B which must be coloured Si. Connected components of the complement 
of B in P n incident with such m-gons M are then (temporarily) recoloured so as to 
be compatible with these colours. 

Step 2 : For each connected component C of the complement of B in P n incident 
with an m-gon M inside B with m — 1 boundary edges in B with internal edge e in 
B coloured with Sj+i (i.e. recoloured from Si in the above step), let D = C U M. 
Apply Lemma 18.31 to D to get a sequence of anticlockwise rotations rotating the 
induced m-angulation of D anticlockwise. Colour the diagonals of D using the 
colour Si on the edge e and the clockwise rule for labelling m-gons. Note that this 
step has the same effect as detaching C from an edge coloured Sj, j i, i + 1, of M 
and reattaching it after Step 1 to a boundary edge of M with the same symbol Sj 
(one step anticlockwise around the boundary of M), keeping the original labelling 
of C (i.e. from before Step 1). Such a boundary edge of M always exists, since 
j ^ i, i + 1. Should no such component C exist, no action is necessary. 

It is easy to see that the above procedure does not depend on the initial choice 
of M as m-gon of B with m — 1 boundary edges. Note also that the procedure 
commutes with any rotational symmetry of P n and so gives a well defined induction 
on a diagonal-coloured m-angulation of P n up to rotation which we denote the same 
way. 

Comparing the definitions of Rf and Tlf it is easy to check that: 

Proposition 8.5. Suppose that 1 < i < m and let G be an m- edge- coloured tree 
with k vertices containing a maximal Si — Si+i chain B. Let M. be the corresponding 
diagonal- coloured m-angulation of P n up to rotation with maximal snake subpolygon 
B corresponding to B. Then Rf{G) corresponds to Rf(A4). □ 

For a simple example of Proposition [531 see Figure IT2"1 

Finally, we show how the above can be modified to the case of a labelled m- 
edge-coloured tree with k vertices. We thus define induction on m-gon-labelled, 
diagonal-coloured m-angulations of P n up to rotation. We also reduce the number 
of anticlockwise diagonal rotations required by allowing detaching and reattaching 
of subpolygons (see Step 2 above). 

Induction on m-gon-labelled diagonal-coloured m-angulations. 

Let A4 be an m-gon-labelled, diagonal-coloured m-angulation of P n and let 1 < i < 
m. Choose a maximal snake subpolygon B of A4 with internal diagonals coloured 
Si or S;+i. Let TZf(M) be defined using the following procedure. 

Detach all subpolygons in the complement of B in P n . We know that the induced 
m-angulation of B has exactly two m-gons with m — 1 boundary edges in B. Let 
M\ be one of them with internal edge e. Let M 2 be the unique m-gon adjacent to 
Mi, M 3 the unique m-gon adjacent to M 2 , and so on. 

If e is coloured Si, rotate the diagonal between M 2 and M 3 one step anticlockwise. 
Let M' 2 and M' z be the new m-gons created, with Mg adjacent to Mi. Repeat for 
M4 and M5, Mq and M7, etc until or 1 m-gons are left. 

If e is coloured Sj+i, rotate the diagonal between Mi and M 2 one step anticlock- 
wise; call the new m-gons created M[ and M 2 with M' 2 incident with a boundary 
edge of M\, Repeat for M3 and M4, etc. until or 1 m-gons are left. 

Recolour all edges in B, starting by exchanging colours Si and Sj+i on the 
diagonals of B and then using the rule that every m-gon inside B must have the 
symbols Si , . . . , S m clockwise on its boundary. Reattach each detached subpolygon 
(keeping its original diagonal labelling) to the m-gon with the same label it was 
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Figure 12. Mutations giving rise to R3 induction (see Proposi- 
tion [53]). The snake subpolygon is shaded. 



originally attached to via the edge coloured with the same symbol. The new m-gon 
label of M[ is defined to be the old m-gon label of M, ; all other m-gons retain the 
same m-gon label. 

It is easy to see that, as in the previous case, the above procedure does not 
depend on the initial choice of M as m-gon of B with m — 1 boundary edges. 
Also, as before, the above procedure commutes with any rotational symmetry of 
P n and so gives a well defined induction on an m-gon-labelled diagonal-coloured m- 
angulation of P n up to rotation which we denote the same way. As in the previous 
case, a comparison of the definitions of Rf and IZf yields: 

Proposition 8.6. Suppose that 1 < i < m and let G be a labelled m- edge- coloured 
tree with k vertices containing a maximal Si — Si+i chain B. Let Ai be the corre- 
sponding m-gon-labelled diagonal- coloured m- angulation of P n up to rotation with 
maximal snake subpolygon B corresponding to B. Then Rf{G) corresponds to 
Rf(M). □ 

Remark 8.7. For Lj induction (in both cases), it is just necessary to replace all 
anticlockwise rotations by clockwise rotations and vice versa. 
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